Decentralized provenance-aware publishing with nanopublications
نویسندگان
چکیده
Publication and archival of scientific results is still commonly considered the responsability of classical publishing companies. Classical forms of publishing, however, which center around printed narrative articles, no longer seem well-suited in the digital age. In particular, there exist currently no efficient, reliable, and agreed-upon methods for publishing scientific datasets, which have become increasingly important for science. In this article, we propose to design scientific data publishing as a web-based bottom-up process, without top-down control of central authorities such as publishing companies. Based on a novel combination of existing concepts and technologies, we present a server network to decentrally store and archive data in the form of nanopublications, an RDF-based format to represent scientific data. We show how this approach allows researchers to publish, retrieve, verify, and recombine datasets of nanopublications in a reliable and trustworthy manner, and we argue that this architecture could be used as a low-level data publication layer to serve the Semantic Web in general. Our evaluation of the current network shows that this system is efficient and reliable. Subjects Bioinformatics, Computer Networks and Communications, Digital Libraries, World Wide Web and Web Science
منابع مشابه
nanopub-java: A Java Library for Nanopublications
The concept of nanopublications was first proposed about six years ago, but it lacked openly available implementations. The library presented here is the first one that has become an official implementation of the nanopublication community. Its core features are stable, but it also contains unofficial and experimental extensions: for publishing to a decentralized server network, for defining se...
متن کاملPublishing DisGeNET as nanopublications
The increasing and unprecedented publication rate in the biomedical field is a major bottleneck for knowledge discovery in the Life Sciences. The manual curation of facts from published scientific papers is slow and inefficient, and therefore new approaches are needed that can enable the automatic, scalable and reliable extraction of assertions. While the publication of scientific assertions an...
متن کاملReliable Granular References to Changing Linked Data
Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublica...
متن کاملGenome Annotation using Nanopublications: An Approach to Interoperability of Genetic Data
With the widespread use of Next Generation Sequencing (NGS) technologies, the primary bottleneck of genetic research has shifted from data production to data analysis. However, annotated datasets produced by different research groups are often in different formats, making genomic comparisons and integration with other datasets challenging and time consuming tasks. Here, we propose a new data in...
متن کاملUsing Nanopublications to Incentivize the Semantic Exposure of Life Science Information
The growing rate of data production in the life sciences creates an urgent need for semantic integration of information. Although the development of tools and infrastructure will make semantic data exposure easier with time, presently the effort associated with creating linked data remains largely unrecognized by peer-review processes, publishers, and promotion committees. Here, we describe a n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PeerJ Computer Science
دوره 2 شماره
صفحات -
تاریخ انتشار 2016